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Introduction 



The last decade has seen increasing interest in methods of functional data analysis which offer novel 
and effective tools for dealing with problems where curves can naturally be viewed as data objects. 



The books by Ramsay & Silverman (2005 1 and Ramsay et al. (2009) offer comprehensive introductions 



to the subject, the collection Ferraty & Romain (2011) reviews some recent developments focusing 



on advances in the relevant theory, while the monographs of Bosq (20001, Ferraty Sz Vieu (2006) 



and Horvath & Kokoszka (2011+) develop the field in several important directions. Despite the 



emergence of many alternative ways of looking at functional data, and many dimension reduction 
approaches, the functional principal components (FPC's) still remain the most important starting 
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point for many functional data analysis procedures, Reiss & Ogden (20071, Gervini (2008), Yao &; 



Miiller (2010), Gabrys et al. (2010) are just a handful of illustrative references. The FPC's are the 



eigenfunctions of the covariance operator. This paper focuses on testing if the covariance operators 
of two functional samples are equal. By the Karhunen-Loeve expansion, this is equivalent to testing 



if both samples have the same set of FPC's. Benko et al. (2009) developed bootstrap procedures for 



testing the equality of specific FPC's. Panaretos et al. (2010) proposed a test of the type we consider 



but assuming that the curves have a Gaussian distribution. The main result of Panaretos et al. (2010| 
follows as a corollary of our more general approach (Theorem [2]). 

Despite their importance, two sample problems for functional data received relatively little attention. 



In addition to the work of Benko et al. (2009) and Panaretos et al. (2010), the relevant references are 



Horvath et al. (2009) and Horvath et al. (2011) who focus, respectively, on the regression kernels in 



functional linear models and the mean of functional data exhibiting temporal dependence. Clearly, if 
some population parameters of two functional samples are different, estimating them using the pooled 
sample may lead to spurious conclusions. Due to the importance of the FPC's, a relatively simple 
and robust procedure for testing the equality of the covariance operators is called for. 
The remainder of this paper is organized as follows. Section |2]] sets out the notation and definitions. 
The construction of the test statistic and its asymptotic properties are developed in Section [3]] 
Section |4]] reports the results of a simulation study and illustrates the procedure by application to 
egg-laying curves of Mediterranean fruit flies. The proofs of the asymptotic results of Section [3]] are 
given in Section [5^ 



2. Preliminaries 

Let Xi, X2, . . . , Xi^ be independent, identically distributed random variables in L2[0,l] with EXi{t) = 
and cov{Xi{t) , Xi{s)) = C{t, s). We assume that another sample Xj*, . . . is also available 
and let = EX*{t) and C*{t,s) = cov{X*{t), X*{s)) for t, s G [0,1]. We wish to test the null 

hypothesis 

Hq: C = C* 

against the alternative Ha that Hq does not hold. 

A crucial assumption considering the asymptotics of our test procedure will be that 



Qn,m = ^ e G (0, 1) as N,M^ 00. 



For the construction of our test procedure we will use an estimate of the asymptotic pooled covariance 
operator of the two given samples (cf. (Q) which is defined by the kernel 

R{t, s) = ec{t, s) + (1 - e)C*{t, s). 

Denote by (Ai, ipi), (A2, ^2), ■ ■ ■ the eigenvalue/eigenfunction pairs of 9^, which are defined by 

Xkfkit) = "y^Mt) = [ Rit, s)ipk{s)ds, t G [0, 1], 1 < A; < 00 . (2) 
Jo 
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Throughout this paper we assume 

Ai > A2 > ... > Ap > Ap+i, (3) 

i.e. there exist at least p distinct (positive) eigenvalues. Under assumption (|3]), we can uniquely (up 
to signs) choose ipi,. . . ,ipp satisfying if we require \\ipi\\ = 1, where for a positive integer d and 
for xeL2 ([0,1]"') 

• • • / x'^ih, . . .,td)dti ■■■dtd] 
Jo / 

Thus, under ([3|, 1 < i < p} is an orthonormal system that can be extended to an orthonormal 
basis {(fi, 1 < i < 00}. 

If Hq holds, then {Xi,ipi), 1 < i < 00, are also the eigenvalues/eigenfunctions of the covariance 
operators (t of the first and of the second sample. To construct a test statistic which converges 
under Hq, we can therefore pool the two samples, as explained in Section [3]] 

3. The test and the asymptotic results 

Our procedure is based on projecting the observations onto a suitably chosen finite-dimensional space. 
To define this space, introduce the empirical pooled covariance operator 9\n^m defined by the kernel 



1 f ^ - - 

— ^(Xfc(t)-X^(t))(X,(s)-X^(s)) 
I k=i 

M 



k=l 

where 

N , M 



k=l k=l 

are the sample mean functions. Let (Aj,^j) denote the eigenvalues/eigenfunctions of ^n,m, i.e. 

1 

Xi^^{t) = d^NM'Piit) = I ^7V,M(t, s)ipi{s)ds, t G [0, 1], 1 < i < iV + M, 





with Ai > A2 > . . . . We can and will assume that the (pi form an orthonormal system. We consider 
the projections 

1 

Mi) = {Xk - Xn, (pi) = j (Xkit) - XNit))^iit)dt (5) 
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and 



aUj) = {x;-x,,,^,)= / [xut)-XMit))^jm 



(6) 





To test Hq, we compare the matrices A^v and AJ^ with entries 

N 

N 



1 ^ 



and 



k=l 



M 



k=l 



We note that Afyf(i, j) — A^j(i, j) is the projection of C]\f{t, s) — C'^j{t, s) in the direction of (pi{t)Lpj[s) 
where 



^ 1 ^ - - 

CN{t, ^) = j^Yl (^'^■(*) - - ^^(^)) 



and 



k=l 



M 



k=l 



are the empirical covariances of the two samples. 

From the columns below the diagonal of Ajv — A^^ we create a vector as follows: 



vech f Aat - A%f 



I A^(l,l)-A*,(l,l) \ 
A;v(2,l)-A*^,(2,l) 



(7) 



For the properties of the vech operator we refer to Abadir & Magnus (2005). 

Next we estimate the asymptotic covariance matrix of {MN/{N + M))^/^^^ jy.j. Let 

LN,M{k, k') = (1 - Qn,m) { j^^^O'i{i)ai{j)ai{i')ai{i') - {^N^i, ^j){^N'?i' \ 
where i, j, i' ,j' depend on A;, k' (see below), and Cat (*^Af ) is interpreted as an operator with Ctv defined 



as 



'^N'?i= / CN{t,s)(pi{s)ds 
Jo 



(An analogous definition holds for From this definition it follows that 



1 ^ 



1=1 



We note that one can use L*^ j^^{k,k') instead of LN^Mik,k'), where L*j^ ^^^{k, k') is defined like 
LN,M{k,k'), but (^^]<[(pi,(pj^ and (^^\j(pi,(pj^ are replaced with if z 7^ j and Aj if i = j. In the 
same spirit, (^^^(pii, (pji^ and (€.\i(pii , (pji^ are replaced with for i' 7^ j' and A,/ if i' = j' . 
The index is computed from k in the following way: Let 



k' = ^-^^-k + l, 



p — i + 1, and j = p — j + 1. 



(8) 



We look at an upper triangle matrix (aj/j/). Then, for column j' , we have that [f — l)j'/2 < k < 
j'if + l)/2. Thus / = \j2k' + I - ^1 and i' = k' - (/ - l)j'/2, where [r] = min{A; € Z : k > r} 



for r S M. Consequently, the index can be computed from k via 



1) -2k + 



9 1 



+ 1 and i = k+ p— p-j + 



jU - 1) 



4 2 

With the above notation, we can formulate the main result of this paper: 
Theorem 1. We assume that Hq, (Op and |5p hold, and 



(9) 



[ E{Xi{t))^dt <oo, [ E{X1 
Jo Jo 



{t))^dt < 00. 



(10) 



Then 



N + M 
2 



where Xp(p+i)/2 stands for a random variable with p{p -\- l)/2 degrees of freedom. 
Theorem [T] implies that the null hypothesis is rejected if the test statistic 



exceeds a critical quantile of the chi-square distribution with p(p + l)/2 degrees of freedom. 

If both samples are Gaussian random processes, the quadratic form m-^'pIm^n M be replaced 

with the normalized sum of the squares of Ajv,M(^) j) — m(*' J)^ stated in the following theorem. 

Theorem 2. If Xi,X* are Gaussian processes and the conditions of Theorem^ are satisfied, then, 
as N,M ^ 00, 



T2 



NM 



I \ AN,M{i,j) 



A 



N + M ^ 2 

l<2,j<p 



\i A j 



» 2 

^ Xp(p+l)/2- 
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Observe that the statistic T2 can be written as 



NM 
N + M 



E 

l<i<j<p 



AAr,M(i,j) - ^ArAfl^'i)) J-^ I AAr,A/(i, i) - z) 

— - + — 



i=l 



2A? 



Next we discuss the asymptotic consistency of the testing procedure based on Theorem[T] Analogously 
to the definition of j^f define the vector ^ = (^(1), . . . , i{p{p + l)/2)) using the columns of the 
matrix 



1 1 



D 



{C{t,s) - C*{t,s))ipi{t)^j{s)dtds 







instead of A^r — A^, i.e. 
'i = vech{D). 



Theorem 3. We assume that Ha, ([ip, ([5p and (10) hold. Then there exist random variables hi = 
hi{N, M), . . . ,hp(^pj^iy2 = ^i'p{p+i)/2i^^ taking values zn{— 1,1} such that, as N,M^ 00, 

max 6v,Af(i) = op{l) (12) 

l<i<p(p+l)/2 



and therefore 



M 



(13) 



where \ ■ \ denotes the Euclidean norm. If ^ ^ and the p largest eigenvalues of C and C* are positive, 
we also have 



Ti— )-oo, as N,M ^00. 



(14) 



The assumption that the p largest eigenvalues of C and C* are positive implies that the random 
functions Xi, i = 1, . . . , N , and X* , j = 1, . . . , M, are not included in a (p— 1) -dimensional subspace. 



The application of the test requires the selection of the number p of the empirical FPC's to be used. 
A rule of thumb is to choose p so that the first p empirical FPC's in each sample (i.e. those calculated 
as the eigenfunctions of C^v and C|,/) explain about 85-90% of the variance in each sample. Choosing 
p too large generally negatively affects the finite sample performance of tests of this type, and for this 
reason we do not study asymptotics as p tends to infinity. It is often illustrative to apply the test for 
a range of the values of p; each p specifies a level of relevance of differences in the curves or kernels. 
A good practical approach is to look at the Karhunen-Loeve approximations of the curves in both 
samples, and choose p which gives approximation errors that can be considered unimportant. 
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Table 1: Empirical sizes of the tests based on statistics Ti and T2 for non-Gaussian data. The curves 
in each sample were generated according to (15). 



P==2 

Sample Sizes 1% 5% 10% 1% 5% 10% 

iV = M = 100 0.005 0.028 0.061 0.152 0.275 0.380 

iV = M = 200 0.003 0.021 0.058 0.163 0.314 0.402 

iV = M = 1000 0.002 0.021 0.056 0.190 0.313 0.426 

P==3 

Sample Sizes 1% 5% 10% 1% 5% 10% 

iV = M = 100 0.004 0.028 0.065 0.167 0.332 0.434 

iV = M = 200 0.004 0.024 0.064 0.194 0.338 0.423 

N = M = 1000 0.004 0.028 0.070 0.240 0.384 0.484 



4. A simulation study and application to medfly data 

We first describe the results of a simulations study designed to evaluate finite sample properties of 
the tests based on the statistics Ti and T2. The emphasis is on the robustness to the violation of the 
assumption of normality. We simulated Gaussian curves as Brownian motions and Brownian bridges, 
and non-Gaussian curves via 

Zn{t) = An sin(7rt) + 5„ sin(27rt) + C„ sin(47rt), (15) 

where An = SYi, i3„, = 3Y2, Cn = Y3 and Yi,Y2,Y3 are independent ts-distributed random variables. 
All curves were simulated at 1000 equidistant points in the interval [0,1], and transformed into 
functional data objects using the Fourier basis with 49 basis functions. For each data generating 
process we used one thousand replications. 

Table [1] displays the empirical sizes for non-Gaussian data. The test based on T2 has severely inflated 



size, due to the violation of the assumption of normality. As documented in Panaretos et al. (2010), 
and confirmed by our own simulations, this test has very good empirical size when the data are 
Gaussian. The test based on Ti is conservative, especially for smaller sample sizes. This is true 
for both Gaussian and non-Gaussian data; there is not much difference in the empirical size of this 
test for different data generating processes. Reflecting its conservative size, statistic Ti leads to 
smaller power than T2. We also studied a Monte Carlo version of the test based on the statistic 

N M^N Ml found that its finite sample properties were similar to those of 
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Figure 1: Ten randomly selected smoothed egg-laying curves of short-lived medflies (left panel), and 
ten such curves for long-lived medflies (right panel). 




the test based on Ti. 

We now describe the results of the application of both tests to an interesting data set consisting of 
egg-laying trajectories of Mediterranean fruit flies (medflies). The data were kindly made available 
to us by Hans-Georg Miiller. This data set has been extensively studied in biological and statistical 



literature, see Miiller &; Stadtmiiller (2005) and references therein. We consider 534 egg-laying curves 
of medflies who lived at least 34 days. We examined two versions of these egg-laying curves, the 
functions in either version are defined over an interval [0, 30], and f < 30 is the day. Version 1 curves 
(denoted Xi(t)) are the absolute counts of eggs laid by fly i on day t. Version 2 curves (denoted 
Yi{t)) are the counts of eggs laid by fly i on day t relative to the total number of eggs laid in the 
lifetime of fly i. The 534 flies are classified into long-lived, i.e. those who lived 44 days or longer, 
and short-lived, i.e. those who died before the end of the 43rd day after birth. In the data set, there 
are 256 short-lived, and 278 long-lived flies. This classification naturally defines two samples: Sample 
1: the egg-laying curves {Xj(t)(resp. li(t)), < t < 30, i = 1,2,. . . ,256} of the short-lived flies. 
Sample 2: the egg-laying curves {X*(f)(resp. Y*{t)), < t < 30, j = 1, 2, . . . , 278} of the long-lived 
flies. The egg-laying curves are very irregular; Figure [T] shows ten (smoothed) curves of short- and 
long-lived flies for version 1, Figure |2] shows ten (smoothed) curves for version 2 (both using a B-spline 
basis for the representation). 

Table [2] shows the P- values for the absolute egg-laying counts (version 1). For the statistic Ti the 
null hypothesis cannot be rejected irrespective of the choice of p. For the statistic T2, the result of 
the test varies depending on the choice of p. As explained in Section [3] the usual recommendation 
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Figure 2: Ten randomly selected smoothed egg-laying curves of short-lived medflies (left panel), and 
ten such curves for long-lived medflies (right panel), relative to the number of eggs laid in the fly's 
lifetime. 




0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 



Table 2: P-values (in percent) of the test based on statistics Ti and T2 applied to absolute medfly 
data. Here fp denotes the fraction of the sample variance explained by the first p FPCs, i.e. fp = 

(ELiA.)/(EtrA.). 



P-values 


P 


2 


3 


4 


5 


6 


7 


8 


9 


Ti 


82.70 


36.22 


30.59 


63.84 


37.71 


39.03 


33.77 


34.77 


T2 


0.54 


O.J 3 


O.J J 


O.J 2 


0.02 


0.00 


0.00 


0.00 




72.93 


78.36 


81.87 


83.94 


85.62 


87.08 


88.49 


89.72 
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Figure 3: Normal QQ-plots for the scores of the version 2 medfly data with respect to the first two 
Fourier basis functions. Left - sample 1, Right - sample 2. 




1 1 1 1 1 1 r 1 1 1 1 1 1 r 

-3-2-10123 -3-2-10123 



Table 3: P-values (in percent) of the test based on statistics Ti applied to relative medfly 
data; fp denotes the fraction of the sample variance explained by the first p FPCs, i.e. fp = 

(ELiA.)/(EfJi^'A,). 



P-values 


P 


2 


3 


4 


5 


6 


7 


8 


fi 


0.14 


0.06 


0.33 


1.50 


3.79 


4.53 


10.28 


fp 


33.99 


44.08 


52.72 


59.04 


65.08 


70.40 


75.29 


P 


9 


10 


11 


12 


13 


14 


15 


fi 


5.51 


2.78 


5.32 


3.21 


1.78 


6.28 


3.80 


fp 


79.91 


83.72 


86.58 


89.02 


91.34 


93.30 


95.03 
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is to use the values of p which explain 85 to 90 percent of the variance. For such values of p, T2 
leads to a clear rejection. Since this test has however overinflated size, we conclude that there is 
little evidence that the covariance structures of version 1 curves for long- and short-lived flies are 
different. For the version 2 curves, the statistic T2 yields P-values equal to zero (in machine precision), 
potentially indicating that the covariance structures for the short- and long-lived flies are different. 
The assumption of a normal distribution is however questionable, as the QQ- plots in Figure [3] show. 
These QQ-plots are constructed for the inner products {Yi,ek) and {Y*,ek), where the Yi are the 
curves from one of the samples (we cannot pool the data to construct QQ-plots because we test if 
the stochastic structures are different), and is the kth element of the Fourier basis. The normality 
of a functional sample implies the normality of all projections onto a complete orthonormal system . 
For {Xi, Ck), the QQ-plots show a strong deviation from a straight line for some projections. Almost 
all projections (Yi^&k) have QQ-plots indicating a strong deviation from normality. It is therefore 
important to apply the robust test based on the statistic Ti. The corresponding P-values for version 
2 are displayed in Table |3j For most values of p, these P-values indicate the rejection of Hq. Many 
of them hover around the 5 percent level, but since the test is conservative, we can with confidence 
view them as favoring Ha- 

The above application confirms the properties of the statistics established through the simulation 
study. It shows that while there is little evidence that the covariance structures for the absolute 
counts are different, there is strong evidence that they are different for relative counts. 



5. Proofs of the results of Section [ST] 



The proof of Theorem [T] follows from several lemmas, which we establish first. We can and will assume 
without loss of generality that /i(t) = /x*(i) = for all t G [0, 1]. 
We will use the identity 




(16) 



fc=i 



1 



N 



Ari/2 



Y,Xk{t)Xk[s) - N 



'^XN{t)XN{s) 



k=l 



and an analogous identity for the second sample. 

Our first lemma establishes bounds in probability which will often be used in the proofs. 



Lemma 1. Under the assumptions of Theorem 



[I[ as N,M ^ 00, 




(17) 



N^'^XN{t)\\=Op{l) 



(18) 
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and 



M 



M-'/'J2™t)XUs)-C*{t,s)} 



k=l 

M'/^X*M{t)\\=Op{l). 

Proof. First we note that 
.1 .1 r N 



Op{l) 



(19) 
(20) 



E / — V{Xfc(t)Xfc(s) -C7(t,s)} dtds= I I E{Xi{t)Xi{s)-C{t,s)fdtds 
Jo Jo \_N2 i^^^ J Jo Jo 



so, by Markov's inequality, we have 

N 



-\y2{Xk{t)Xk{s)-c{t,s)} 



k=l 



Op(l). 



Similar arguments yield (18) - (20). 



□ 



The next lemma shows that the estimation of the mean functions, cf. the definition of the projections 
ak{i) and a^(j) in (jsj) and Q, has an asymptotically negligible effect. 

Lemma 2. Under the assumptions of Theorem^ for all 1 < i, j < p, as N,M ^ oo, 

1 ^ N 



and 



k=l 



M 



k=l 



Proof. Using (16) and (18) we have by the Cauchy-Schwarz inequality, 



1 fi 



JO 



N^/'^X N{t)X Nis)lpi{t)lpj{s)dt ds 



N'^/'^XN{t)(pi{t)dt 







N'^''^XN{s)(pj{s)dt 



dt / 0i{t)dt 



JO 

= iV-i/2 y {N^/^X^{t)fdt 
= Op . 
The second part can be proven in the same way. 







N^'^Xn{s)^'^ ds ^ 0]{s)d 



1/2 



□ 



We now state bounds on the distances between the estimated and the population eigenvalues and 
eigenfunctions. These bounds are true under the null hypothesis, and extend the corresponding one 
sample bounds. 
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Lemma 3. // the conditions of Theorem^ are satisfied, then, as N,M ^ oo, 
max lAi - Ail = Op ((N + M)-^/^ 

l<i<p \ 



and 



where 



max ll^j -CiifiW = Op ({N + My 

l<i<p \ 



1/2 



Ci = Ci{N,M) = sign(((^j,(y9j)). 



Proof. It follows from (|17|) - (|20|) and the assumption C = C* that 

Rn,m — C 



Op + j /{N + M) 

and since N^^"^ + M^^'^ < 2(N -\- M)^^"^ , the result follows from the corresponding one sample bounds, 



see e.g. Chapter 2 of Horvath & Kokoszka (2011+ ). 



□ 



Lemma [3] now allows us to replace the estimated eigenfunctions by their population counterparts. 
The random signs q must appear in the formulation of Lemma |4j but they cancel in the subsequent 
results. 

Lemma 4. // the conditions of Theorem^ are satisfied, then, for all 1 < i, j < p, as N,M ^ oo, 
NM \ 

AN{i,j)-Alj{i,j) 



N + M 



Jj: ^{Xk,Ciipi){Xk,Cjipj) - — ^{Xl,CrLpi){Xl,Cjipj) \ + Op(l). 
■ ' k=l ) 



N + M 



k=l 



Proof. We write 

N 

N 



hy2(^k,<ft){Xk,^j) - [ [ C{t,s)(pi(t)(pj{s)dt ds 
V Jq Jo 

= ivV2 I _L_ ^ {Xk{t)Xk{s) - C{t, s)) I (pMH^)dt ds. 

Using Lemmas [T] - [3] we get 

Jo |iVV2 ^ {Xkit)Xk{s) - C(t,s)) I {lpi{t)ipj{s) -Ciipi{t)cjipj{s))dtds 
I' l'[-WU-2 E iXk{t)X,{s) - C{t, s)) I 



X {{^iit) -CiLpi{t))(pj{s) + cnpi{t)[(pj{s) -Cj(pj{s))] dtds 
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< 



+ 



\ 1/2 

(pi{t) -Ciipi{t)) 0^j{s)dtds\ 



10 Jo 



1 fl 



1/2 



X / / ^'iit){'Pjis) -Cj^pjis)) dtds 



10 JO 

N 



iVl/2 



fc=l 



{ ll^j - Ci(/?j|| + W^pj -Cj(pj\\ } 



= op(l). 

Similar arguments give that 
.1 .ij^ 




A/ 



io |mV^^(^^^*^^^*^''^"^*^*'''^)| {^iW^j(^)-c*'/'*(i)ci^i(s)}dtds 



Op(l). 



Since C = C*, the lemma is proven. 



□ 



The previous lemmas isolated the main terms in the differences A]\f{i,j) — A^^(i,j). The following 
lemma describes the limits of these main terms (without the random signs). 

Lemma 5. // the conditions of Theorem^ are satisfied, then, as N,M ^ oo. 



V 



{AN,M{i,j)A <'i',j <P} — > {A(z,j),l < i,j <p}, 



where 



N M '\ 

k=l k=l ) 



and {A(i,j), 1 < i,i < p} is a Gaussian matrix with EA{i,j) = and 

EA{i,j)A{i',j') = {l-e){E{{Xi,ip^){Xi,^j){Xi,^i,){Xi,^,,)) 

- E{{Xi, ipi){Xi,^j))E{{Xi, ^i,){Xi,^,,))} 
+ Q{E{{Xl,ip,){Xl,^,){Xl,ip,,){Xl,^,,)) 

- E{{Xl,cp,){Xl,ipj))E{{Xl,^e){Xl,ipj>))} . 

Proof. First we note that 



E{Xi,^i){Xi,ipj) = E {XI, ^i,) {XI, if J, 



if i^j, 
Xi if i = j. 



Since E({Xi, (fi) {Xi, ipj)^'^ < oo and E(^{X^,ipi))X^,Lpj)^'^ < oo, the multivariate central limit theo- 
rem implies the result. □ 
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Finally, we need an asymptotic approximation to the covariances L7v,m(^) k'). Let 

N 

N 



LN,M{k,k') = (1 - Qn,m) \ jj^(^fXi)(^e{j)0'e{i')ae{j') - (€N^i,^j){^N'fi',^j') 



where 

ae{i) = {Xi,ipi) and a}{{} = {X^,ipi), 
and are determined from k and k' as in ([8| and (joj). 

Lemma 6. // the conditions of Theorem^ are satisfied, then for all 1 < k,k' < p{p + l)/2, 

LN,Mik, k') — CiCjCi'Cji Lj\f^M{k, k') = op{l) as N, M ^ oo, 
where {i,j) and {i',j') are determined from k and k' as in (jsj) and (joj). 

Proof. The result follows from Lemma [3] along the lines of the proof of Lemma |4] □ 

Proof of Theorem [TJ According to Lemma [2] and Lemmas |4] - [6j the asymptotic distribution of 
M-^N^M^N,]^ does not depend on the signs ci,...,Cp, so it is sufficient to prove the result for 
ci = . . . = Cp = 1. The law of large numbers yields that 

LN,M{k,k')-^L{k,k'), (21) 

where 

L{k,k') = (l-e){ii;(ai(i)ai(i)ai(Oai(jO) -ii^(ai(i)ai(i)ai(Oai(/))} (22) 
+ G {i?(aKz)aI(jK(0«*i(/)) - i?(aK^)«I(jK(Oa*i(j')) } ■ 

Now the result follows from Lemmas |2] |4] and |5] □ 

Proof of Theorem [21 We continue to assume that fj, = fi* = 0. This means that, under Hq, 
Xi, . . . ,Xi\l,Xl, . . . jX'^j are independent and identically distributed Gaussian processes. Hence 

1 /2 1/2 

{Xi,ipi) = X- Ni, {Xi,ipj) = X- N*, where Ni,N*,l < i,j < p are independent standard nor- 
mal random variables. We have already pointed out that 

E{Xi,^i){Xi,ipj) = 



if i^j, 
Xi if i = j, 



and 

E{Xi,ipi'){Xi,ipj') 



if z'^f, 
Xi' if i=j'. 
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li i = j = i' = j' , then 

E{{X,,V.d)' - {E{{X,,^,))')' = X^E{N^ - (ENff) =2X1 
li i = i' and j = f (i ^ j), then 

E{Xi,(pi){Xi,ipj){Xi,ipi'){Xi,ipj/) = XiXj. 
In all other cases 

E{Xi,(pi){Xi,ipj){Xi,ipi'){Xi,ipj/) = 0. 
Hence A{i,j), 1 < i < j < p, are independent normal random variables with mean and 

[ 2Xf if i = j. 
Now the result follows from Lemmas [T] - [5j 

Proof of Theorem |3l First we observe that by the law of large numbers we have 



□ 



1 1 



j j {RN,M{t,s) - R{t,s)fdtds = Op{l). 




Hence using the result in section VI. 1. of Gohberg et al. (19901 (cf. Lemmas 2.2 and 2.3 in Horvath 

(23) 



fc Kokoszkal ( |2011+[ )) we get that 

Op(l) 



max 

l<i<p 



A-; — A-; 



and 



max -cupiW = op{l), 

l<i<p 



(24) 



1 1 



where Cj = Ci{N,M) = sign((^j, c^j)). Relations (23) and (24) show that Lemma |3] remains true. It 
follows from the law of large numbers and (24) that for all 1 < i,j <p 

1 1 

A7v(i, j) - A^/(i, j) - CiCj j j iC{t, s) - C*{t, s)) ^^{t)^j{s)dtds 



1 1 

(^CN{t, s) — Cl,j{t, s)^ (pi{t)(pj{s)dt ds — CiCj J J {C{t, s) — C*{t, s)) (pi{t)ipj{s)dt ds 



'CNit, s) - C{t, s) - (dl,{t, s) - C*{t, s))) ^i{t)ip,{s)dt 
{C{t, s) - C*{t, s)) {(pi{t)(pj{s) - Ciipi{t)ciipj{s)) dtds 





1 1 




ids 





1 1 
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< 



Cn — C 
op{l), 



+ 



+ ||C - C*|| \\^i(pj - CiipiCiifjl 



where the fact that \\(pi\\ = 1 = ||^i|| was used. Hence the proof of (12) is complete. It is also clear 



that (12) implies (13). 



Next we observe that Lemma |6] and (21) remain true under the alternative. Now by some lengthy 



calculations it can be verified that L given in ( 22 ) is positive definite so that ( 14 ) follows from ( 13 ) . □ 
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